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Coronavirus budding at the endoplasmic reticulum-Golgi intermediate compartment (ERGIC) requires 
accumulation of the viral envelope proteins at this point in the secretory pathway. Here we demonstrate that 
the spike (S) protein from the group 3 coronavirus infectious bronchitis virus (IBV) contains a canonical 
dilysine endoplasmic reticulum retrieval signal (-KKXX-COOH) in its cytoplasmic tail. This signal can retain 
a chimeric reporter protein in the ERGIC and when mutated allows transport of the full-length S protein as 
well as the chimera to the plasma membrane. Interestingly, the IBV S protein also contains a tyrosine-based 
endocytosis signal in its cytoplasmic tail, suggesting that any S protein that escapes the ERGIC will be rapidly 
endocytosed when it reaches the plasma membrane. We also identified a novel dibasic motif (-KXHXX-COOH) 
in the cytoplasmic tails of S proteins from group 1 coronaviruses and from the newly identified coronavirus 
implicated in severe acute respiratory syndrome. This dibasic motif also retained a reporter protein in the 
ERGIC, similar to the dilysine motif in IBV S. The cytoplasmic tails of S proteins from group 2 coronaviruses 
lack an intracellular localization signal. The inherent differences in S-protein trafficking could point to 
interesting variations in pathogenesis of coronaviruses, since increased levels of surface S protein could 
promote syncytium formation and direct cell-to-cell spread of the infection. 


Most enveloped viruses assemble at the cytoplasmic face of 
the plasma membrane and bud out of the cell (reviewed in 
reference 12). The envelope proteins of these viruses are syn¬ 
thesized in the secretory pathway and accumulate at the 
plasma membrane. However, some enveloped viruses assem¬ 
ble intracellularly, obtaining their lipid envelope from intracel¬ 
lular, membrane-bound compartments. These viruses bud into 
the lumens of intracellular compartments and exit the cell by 
exocytosis. For example, flaviviruses assemble at the endoplas¬ 
mic reticulum (ER) (9, 31, 34), coronaviruses assemble at the 
ER-Golgi intermediate compartment (ERGIC) (27), and bunya- 
viruses (35) and rubella virus (22) bud into the Golgi. The 
envelope proteins of viruses that assemble in intracellular com¬ 
partments possess signals that direct them to the site of viral 
assembly (reviewed in reference 18). These signals mimic those 
used by endogenous cellular proteins and utilize cellular ma¬ 
chinery for localization. 

The first ER localization signal for a membrane protein was 
identified in the adenovirus E3-19K protein (24, 37). This 
signal consists of lysine residues at the —3 and —4 (or —5) 
positions relative to the C terminus (51). Dilysine signals were 
subsequently shown to direct retrieval of escaped proteins 
from post-ER compartments back to the ER. Proteins with the 
dilysine signal bind the coatomer complex (COPI) and are 
recruited into vesicles that travel in a retrograde direction 
relative to the ER (8, 13). The efficiency of binding to COPI is 
influenced by the sequence context surrounding the dilysine 
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signal, which contributes to steady-state localization of pro¬ 
teins bearing this signal to the ER, ERGIC, or Golgi complex 
(51). The envelope glycoprotein from the retrovirus human 
foamy virus also contains a dilysine signal (15,16). This dilysine 
signal directs budding of this virus into intracellular compart¬ 
ments (14). Other types of targeting signals have been identi¬ 
fied in envelope proteins of viruses that assemble at the 
ERGIC or Golgi, although the mechanism by which they work 
is not understood (reviewed in reference 21). 

Coronaviridae are members of the Nidovirales order and 
contain a positive-strand RNA genome ranging from 27 to 31 
kb in size (47). Coronaviruses are classified into groups 1, 2, or 
3 by sequence homology (17) and infect a wide range of ver¬ 
tebrate species. Their cellular tropism also varies, since differ¬ 
ent coronaviruses infect the gastrointestinal tract, respiratory 
tract, and nervous system. The recent emergence of the coro¬ 
navirus that causes severe acute respiratory syndrome (SARS) 
has focused a great deal of interest on coronaviruses (23). 

Coronaviruses contain three envelope proteins: envelope 
(E), membrane (M), and spike (S). The E protein is present in 
low levels in the mature virion but plays a critical role in viral 
assembly (10, 28, 41). M is the most abundant protein in the 
viral envelope and is important for virus maturation, interact¬ 
ing with E, S, and the nucleocapsid during assembly (40, 49, 
53). When expressed together from cDNA, the coronavirus E 
and M proteins interact and form virus-like particles (5, 6, 53). 
The S protein is less abundant than M in virions and is respon¬ 
sible for binding and fusion to host cells (reviewed in reference 
11 ). 

We study the group 3 coronavirus infectious bronchitis virus 
(IBV) as a model for intracellular assembly at the ERGIC. 
IBV E contains a Golgi targeting signal within its cytoplasmic 
tail (4). IBV M contains a Golgi targeting signal located in its 
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first transmembrane domain (33, 50). Thus, IBV M and E 
move past the virus assembly site when expressed individually, 
and it is not yet known how the viral envelope proteins are 
collected together in the ERGIC in infected cells. 

When S proteins from different coronaviruses are exog¬ 
enously expressed, a large portion remains intracellular (54). 
Slow folding of the large lumenal domain in the ER could 
contribute to this localization (39). Here we demonstrate a 
canonical dilysine ER retrieval signal located at the C terminus 
of IBV S. This dilysine signal is sufficient to retain IBV S 
intracellularly and retains a chimeric protein containing the 
cytoplasmic tail of IBV S in the ERGIC. We also discovered a 
novel dibasic signal in group 1 coronavirus and SARS S pro¬ 
teins that is similar to the dilysine signal in IBV S. These two 
types of signals are likely to contribute to the localization of S 
protein at the virus assembly site. Our observations support the 
hypothesis first proposed by Vennema et al. (54) suggesting 
that coronavirus S proteins contain intracellular retention sig¬ 
nals. 


MATERIALS AND METHODS 

Cells and transfection. HeLa cells were maintained in Dulbecco’s modified 
Eagle’s medium containing 10% fetal calf serum and antibiotics. Transient trans¬ 
fections were performed using 2 |xg of pcDNA3.1/Hygro (Invitrogen, Grand 
Island, N.Y.) encoding the appropriate cDNA and 6 pi of LT-1 transfection 
reagent (Mirus, Madison, Wis.) following the manufacturer’s instructions. Ex¬ 
pression was analyzed at 18 h posttransfection, except for pcDNA3.1/S and 
pcDNA3.1/S 2 A transfections, which were analyzed at 3 days posttransfection. 

Plasmids and expression vectors. We cloned the cDNA encoding IBV S by 
reverse transcription-PCR) using RNA from Vero cells infected with the Vero- 
adapted strain of IBV (33). The full-length cDNA was assembled in pBluescript 
(Stratagene, La Jolla, Calif.) and sequenced by dideoxy sequencing at the Johns 
Hopkins DNA Synthesis and Sequencing Facility. The cloned Vero-adapted IBV 
S gene contained 14 amino acid changes compared to the published Beaudette 
IBV S sequence (2). Twelve of these amino acid changes (Leu 12 2 to lie, Leu 130 
to Phe, Asn 261 to Thr, Lys 330 to Asn, Ile 336 to Val, Lys 364 to Ser, Asn 421 to His, 
Ile 474 to Val, Asn 683 to Thr, Lys 689 to Arg, Asn 709 to Asp, and Phe 723 to Leu) 
were found in other strains of IBV, while two unique amino acid changes were 
Asn 842 to Asp and Ser 1(n2 to lie. For transient transfection in HeLa cells, the 
full-length cDNA encoding IBV S was subcloned into pcDNA3.1/Hygro utilizing 
the Xbal and Apal sites. The dilysine located at the C terminus of the S protein 
was mutagenized using the QuikChange site-directed mutagenesis kit (Strat¬ 
agene, La Jolla, Calif.), generating the PCDNA3.1/S2A mutant signal (Lys 1159 to 
Ala and Lys 1160 to Ala). 

The G-S constructs were generated by introducing a BamHI site by 
QuikChange site-directed mutagenesis at the end of the S transmembrane do¬ 
main at Gly 1119 . The resulting protein, S XMB , contained one amino acid change 
(Cysnzo to He), but this did not affect normal trafficking of the S protein (data 
not shown). The S cytoplasmic tail coding sequence was then subcloned in place 
of the G cytoplasmic tail in pcDNA3.1/G TMB (42), using the BamHI and Xhol 
sites, generating pcDNA3.1/G-S and pcDNAS.l/G-S^. Mutation of the putative 
tyrosine internalization signal (Tyr 1143 to Ala and Tyr 1144 to Ala) located up¬ 
stream of the dilysine signal (see Fig. 1A) was performed using the QuikChange 
site-directed mutagenesis kit in the pcDNA3.1/G-S 2A plasmid to generate 
pcDNA3.1/G-S 4A . To generate the pcDNA3.1/G-S BCV construct, sense and an¬ 
tisense 42-mer oligonucleotides encoding the C-terminal 11 amino acids of 
bovine coronavirus (BCV) S (GenBank accession no. P25193) flanked by Aatl 
and Apal restriction sites were annealed in a solution containing 100 mM 
potassium acetate, 30 mM HEPES-KOH (pH 7.4), and 2 mM magnesium ace¬ 
tate and phosphorylated with T4 DNA kinase. A silent Aatll site was introduced 
11 amino acids before the C terminus of the pcDNA3.1/G-S by QuikChange 
site-directed mutagenesis to create pcDNA3.1/G-S(AatII). A DNA fragment 
encoding the last 11 residues of IBV S was excised from pcDNA3.1/G-S (Aatll) 
with Aatll and Apal, and the BCV tail sequence was inserted, generating 
pcDNA3.1/G-S BCV . The same procedure was used to generate other G-S chi¬ 
meric proteins containing the last 11 amino acids of coronavirus S proteins from 
transmissible gastroenteritis virus (TGEV) (GenBank accession no. NP_058424), 
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FIG. 1. Mutation of the C-terminal dilysine signal promotes trans¬ 
port of IBV S to the cell surface. (A) The sequence of the IBV S 
cytoplasmic tail, with the KKXX motif at the C terminus. Alanine 
residues (indicated with arrows) replace the dilysine motif in the S 2 a 
mutant. A potential endocytosis signal (YYTF) is indicated in italics. 
(B) Intact HeLa cells expressing wild-type IBV S or S 2 A were stained 
with mouse anti-IBV S at 4°C and then fixed, permeabilized, and 
stained with rabbit anti-IBV S for internal expression. Secondary an¬ 
tibodies were fluorescein-conjugated donkey anti-rabbit immunoglob¬ 
ulin G (IgG) and Texas Red-conjugated goat anti-mouse IgG. Bar, 10 
p,m. 


murine hepatitis virus (MHV) strain A59 (GenBank accession no. VGIH59), and 
Urbani SARS virus (GenBank accession no. AAP72986). 

The pcDNA3.1/G-Ssars2a construct was generated by mutagenizing the 
codons for His 1158 and Lys 1161 of pcDNA3.1/G-S SARS to alanines by 
QuikChange site-directed mutagenesis. The same procedure was used to con¬ 
struct pcDNA3.1/G-S XGEV 2A from pcDNA3.1/G-S XGEV . 

Antibodies. The mouse monoclonal anti-IBV S antibody (9B1B6) recognizes 
the lumenal domain and was a kind gift of Ellen Collisson (Texas A&M Uni¬ 
versity, College Station, Tex.) (55). The rabbit polyclonal anti-IBV S antibody 
was generated from a bacterial fusion protein consisting of a His 6 tag fused to the 
cytoplasmic tail of IBV S. The IBV S-tail sequence was subcloned into pRSET 
expression vector C (Invitrogen, Grand Island, N.Y.) and transformed into 
Escherichia coli BL21(DE3)/pLysS (Invitrogen). Expression was induced for 1 h 
with 100 mM isopropyl-(J-D-thiogalactopyranoside. The fusion protein was puri¬ 
fied on nickel-agarose beads (Qiagen, Valencia, Calif.), following the manufac¬ 
turer’s instructions. The rabbit anti-IBV S tail antibody was specific for IBV S in 
immunoprecipitation experiments (data not shown). The mouse anti-vesicular 
stomatitis virus glycoprotein (VSV-G) monoclonal antibody used in immunoflu¬ 
orescence recognizes the lumenal domain of VSV-G (29). For immunofluores¬ 
cence, rabbit polyclonal affinity-purified anti-VSV-G was also used (45). For 
immunoprecipitation of radiolabeled VSV-G and G-S proteins, a rabbit antibody 
to whole VSV was used (56). The mouse monoclonal anti-ERGIC-53 antibody 
was a kind gift of H. P. Hauri (Basel, Switzerland). Mouse anti-GM130 was from 
Transduction Labs (San Jose, Calif.), and rabbit anticalnexin was from StressGen 
(San Diego, Calif.) 

Indirect immunofluorescence microscopy. HeLa cells were plated on cover- 
slips in 35-mm-diameter dishes 1 day before transfection and transfected as 
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described above. At 18 h posttransfection (or 3 days for IBV S), cells were fixed 
in 3% paraformaldehyde in phosphate-buffered saline (PBS) for 10 min at room 
temperature, permeabilized with 0.5% Triton X-100 for 3 min, and stained as 
previously described (50). For staining with anticalnexin, cells were permeabil¬ 
ized with 0.1% saponin instead of Triton X-100. To label surface IBV S or G-S 
chimeric proteins, intact cells were washed with ice-cold PBS and incubated with 
mouse anti-IBV S or anti-VSV-G for 10 min at 4°C. Cells were then washed two 
times with ice-cold PBS and fixed and permeabilized as described above. Cells 
were then stained for intracellular proteins with either rabbit affinity-purified 
anti-VSV-G or rabbit anti-IBV S tail as described above. Live cell antibody 
uptake was performed by incubating HeLa cells expressing G-S chimeric proteins 
with mouse anti-VSV-G (0.5 (xg/ml in culture medium) for 15 min at 37°C. Cells 
were then fixed and permeabilized as described above and stained for internal 
protein with rabbit affinity-purified anti-VSV-G. Images were acquired with a 
Zeiss (Thornwood, N.Y.) Axioscop microscope equipped for epifluorescence 
with a Sensys charge-coupled-device camera (Photometries, Tucson, Ariz.), using 
IPLab software (Scanalytics, Vienna, Va.). 

Metabolic labeling, immunoprecipitation, and glycosidase digestion. HeLa 
cells were transfected as described above and at 16 h post transfection were 
starved for 20 min with methionine- and cysteine-free medium at 37°C. The cells 
were then labeled with 50 uCi of 35 S-Promix (Amersham Pharmacia Biotech, 
Piscataway, N.J.) per ml in methionine- and cysteine-free medium for 15 min at 
37°C. Cells were chased in unlabeled medium for 0, 15, 30, or 60 min. After a 
rinse in ice-cold PBS, the cells were lysed in a detergent solution (62.5 mM 
EDTA, 50 mM Tris-HCl [pH 8.0], 0.4% deoxycholate, 1% NP-40) containing 
protease inhibitors. After removal of nuclei and debris by centrifugation at 
16,000 X g for 3 min, the lysate was incubated at 37°C for 20 min with 2 pi of 
rabbit anti-VSV. Antibody-antigen complexes were collected by incubation with 
fixed Staphylococcus aureus (Pansorbin; Calbiochem, San Diego, Calif.) for 15 
min at room temperature. Immunoprecipitates were washed in RIP A buffer 
(0.1% sodium dodecyl sulfate [SDS], 50 mM Tris-HCl [pH 8.0], 1% Na-deoxy- 
cholate, 150 mM NaCl, 1% Triton X-100) and subsequently eluted in 20 pi of 50 
mM Tris-HCl (pH 6.8)—1% SDS by incubation at 100°C for 3 min. An equal 
volume of 0.15 M sodium citrate, pH 5.5, containing 0.4 mU of recombinant 
endoglycosidase H (endo H) (New England Biolabs, Beverly, Mass.) was added, 
and samples were incubated for 16 h at 37°C. The samples were then subjected 
to SDS-polyacrylamide gel electrophoresis (PAGE) using 10% gels, and oligo¬ 
saccharide processing was quantified on a Molecular Imager FX phosphorimager 
(Bio-Rad, Hercules, Calif.) by comparing band densities of endo H-sensitive and 
-resistant forms, using Quantity One software (Bio-Rad). 


RESULTS 

Mutation of the putative dilysine signal in IBV results in 
surface expression. Vennema et al. (54) showed that corona- 
virus S proteins expressed exogenously were transported 
through the secretory pathway more slowly than S proteins 
in infected cells. They proposed that coronavirus S proteins 
might contain intracellular retention signals that would be 
masked after virus assembly. Examination of the Beaudette 
IBV S protein sequence indicated a putative ER retrieval 
signal in its cytoplasmic tail, consisting of lysine residues at 
the —3 and —4 positions from the C terminus (Fig. 1A). 
These two lysines are conserved in S proteins from eight 
different strains of IBV whose sequences are available in 
GenBank, as well as in turkey coronavirus. To determine the 
contribution of the putative dilysine signal to the localiza¬ 
tion of IBV S, we expressed the wild-type protein and a 
mutant where both lysines were replaced by alanines. Tran¬ 
siently transfected HeLa cells were examined by indirect 
immunofluorescence microscopy (Fig. IB). Wild-type S was 
not detected at the plasma membrane, and it accumulated in 
intracellular compartments resembling the ER and ERGIC 
(see below). The mutant lacking the dilysine signal (S 2A ) 
was readily detected at the cell surface as well as throughout 
the secretory pathway. Thus, the dilysine signal plays a role 
in accumulation of the IBV S protein near the virus assem¬ 


bly site. However, the low levels of expression of the IBV S 
protein that were obtained from transfection of cDNA made 
further study of this targeting signal difficult in the context 
of the full-length S protein. 

Targeting information in the IBV S cytoplasmic tail retains 
a reporter protein intracellularly. To study the targeting signal 
in the IBV S cytoplasmic tail, we asked if this domain could 
retain a reporter protein intracellularly. VSV-G is a well-stud¬ 
ied plasma membrane protein that has the same membrane 
topology as the IBV S protein (a type I membrane protein). 
We replaced the cytoplasmic tail of VSV-G with that of IBV S 
to generate the chimeric protein G-S (Fig. 2A). HeLa cells 
were transiently transfected with plasmids encoding VSV-G, 
G-S, or G-S 2A (with the dilysine signal replaced with alanines) 
and were examined by indirect immunofluorescence micros¬ 
copy (Fig. 2B). As expected, VSV-G exhibited significant sur¬ 
face expression with internal staining throughout the secretory 
pathway. G-S was absent from the plasma membrane in most 
cells and was instead localized to intracellular compartments. 
A low percentage of transfected cells (~13%) expressed G-S 
on the cell surface (data not shown). These cells appeared to 
express higher levels of G-S than cells that did not have de¬ 
tectable protein at the cell surface. It is likely that the machin¬ 
ery that recognizes the intracellular localization signal can be 
saturated (52). 

Replacing the two lysine residues with alanines (G-S 2A ) re¬ 
sulted in expression on the cell surface with a patchy distribu¬ 
tion. Internal staining of cells expressing G-S 2A demonstrated 
that this protein was largely localized in the Golgi region and 
numerous small puncta throughout the cytoplasm. Upon closer 
examination of the IBV S-tail sequence, we observed a poten¬ 
tial tyrosine internalization signal (YYTF) (Fig. 1A) (reviewed 
in reference 3) upstream of the dilysine signal. Replacing the 
two tyrosine residues in this sequence with alanines (G-S 4A ) 
resulted in homogeneous plasma membrane distribution, with¬ 
out the puncta seen with G-S 2A (Fig. 2B). 

Steady-state localization of G-S chimeric proteins. To 
determine the intracellular localization of G-S, transiently 
transfected HeLa cells were double labeled with antibodies 
recognizing various resident proteins localized in different in¬ 
tracellular compartments (Fig. 3A). The localization of G-S 
partially overlapped those of the ER resident protein calnexin 
and the Golgi protein GM130. However, the distribution of 
G-S most strongly overlapped with that of ERGIC-53, indicat¬ 
ing that G-S largely accumulated in this intermediate compart¬ 
ment (Fig. 3A, color merge panels). The steady-state localiza¬ 
tion of G-S in the ERGIC instead of the ER may reflect the 
contribution of the sequence surrounding the dilysine signal, as 
shown for the ERGIC-53 protein (1). The full-length IBV S 
protein, however, showed more ER staining than G-S (com¬ 
pare Fig. IB and 3A). This difference in steady-state localiza¬ 
tion probably reflects the slow folding of the large lumenal 
domain of coronavirus S proteins in the ER (39). By replacing 
the lumenal domain of IBV S with that of VSV-G, we were 
able to explore the contribution of the dilysine signal to tar¬ 
geting in the absence of slow folding and exit from the ER. 

Unlike G-S, G-S 2A was localized in distinct puncta that 
could represent endocytotic structures following internaliza¬ 
tion from the plasma membrane (Fig. 2B). To test this idea, 
live cells expressing G-S 2A or G-S 4A were incubated for 15 min 
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FIG. 2. The IBV S cytoplasmic tail retains a reporter protein in- 
tracellularly. (A) The G-S chimera contains the lumenal head (black) 
and transmembrane domain (gray) from VSV-G and the cytoplasmic 
domain from IBV S (white; sequence shown in Fig. 1A). G-S 2 A has the 
dilysine signal mutagenized to alanines residues, and G-s 4A has both 
the upstream tyrosine internalization signal and the dilysine signal 
mutagenized to alanine residues. (B) Intact HeLa cells expressing 
VSV-G, G-S, G-S 2 A; or G-S 4A were stained with mouse anti-VSV-G at 
4°C, fixed, permeabilized, and then stained with rabbit anti-VSV-G for 
internal expression. Secondary antibodies were fluorescein-conjugated 
donkey anti-rabbit IgG and Texas Red-conjugated goat anti-mouse 
IgG. Bar, 10 p,m. 


at 37°C with mouse anti-VSV-G antibodies, as described in 
Materials and Methods. After fixation and permeabilization, 
cells were stained with rabbit anti-VSV-G for detection of the 
total G-S 2A pool (Fig. 3B). The intracellular puncta containing 
G-S 2A were labeled by exogenously added antibody (Fig. 3B, 
color merge), indicating that at least a portion of G-S 2A was 
efficiently endocytosed after reaching the plasma membrane. 


Mutation of the tyrosine residues in G S 4A demonstrated that 
the YYTF sequence was the likely internalization signal, since 
G-S 4A was evenly distributed at the cell surface and little anti- 
VSV-G antibody was internalized. Note that the constructs 
described below containing the C-terminal residues of other 
coronavirus S proteins all have the IBV S tyrosine internaliza¬ 
tion motif. 

Mutation of the dilysine signal in G-S results in efficient 
trafficking through the Golgi complex. During biosynthesis, 
VSV-G obtains two N-linked oligosaccharides in the ER. The 
oligosaccharides are processed in the medial Golgi, where they 
become resistant to digestion with the enzyme endo H. Thus, 
the rate of trafficking through the Golgi can be inferred from 
the rate of acquisition of endo H resistance. FleLa cells ex¬ 
pressing VSV-G, G-S, G-S 2A , or G-S 4A were pulse labeled with 
[ 35 S]methionine-cysteine for 15 min and chased for various 
times in medium containing unlabeled amino acids. The ex¬ 
pressed proteins were immunoprecipitated and subjected to 
endo FI digestion (Fig. 4A). Note that G-S chimeric proteins 
are slightly larger than VSV-G and run accordingly on SDS- 
PAGE. VSV-G, G-S 2A , and G-S 4A proteins rapidly became 
resistant to endo H during the chase. However, only a small 
portion of G-S acquired endo H resistance by 60 min. Quan¬ 
titation of four similar experiments showed that VSV-G, 
G-S 2A , and G-S 4A all move through the Golgi at similar rates, 
whereas G-S is mostly retained in a compartment prior to the 
medial Golgi (Fig. 4B). The low rate of processing of G-S 
could reflect the low percentage of cells that exhibit surface 
staining for this protein. Given the steady-state localization 
and slow processing of G-S, we conclude that the dilysine 
sequence serves as a functional ER retrieval signal for IBV S. 

Group 2 coronaviruses lack an intracellular localization 
signal in their cytoplasmic tails. The cytoplasmic tails of group 
2 coronaviruses showed no obvious signal or motif that would 
lead to their sequestration within intracellular compartments. 
To test this directly, we replaced the last 11 amino acids of the 
G-S chimera with the final 11 amino acids from BCV or MHV 
strain A59 to generate G-S BCV and G-S MHV , respectively (Fig. 
5A). We chose to swap the last 11 residues in G-S in case a 
targeting signal existed in this sequence and because a unique 
restriction site could be introduced at this position with a silent 
mutation. HeLa cells were transfected with cDNAs encoding 
G-S 2A G-S bcv , and G-S MHV and assayed by indirect immu¬ 
nofluorescence microscopy. G-S BCV and G-S MHV were ex¬ 
pressed on the cell surface, with distinct internal staining re¬ 
sembling the puncta seen with g-s 2A (Fig. 5B). Pulse-chase 
analysis of G-S BCV and G-S MHV demonstrated that oligosac¬ 
charide processing in the Golgi occurred at rates comparable 
to that of G-S 2A (Fig. 5C). Thus, unlike the group 3 corona- 
virus IBV, the C-terminal 11 amino acids of BCV S and MHV 
S lack signals for localization in the early secretory pathway. 

S proteins of group 1 coronaviruses and the SARS corona- 
virus possess a novel intracellular localization signal. TGEV 
is a group 1 coronavirus (17), while the recently identified 
SARS coronavirus has been reported to be distantly related to 
group 2 coronaviruses (48). Examination of the cytoplasmic 
tail sequences of the S proteins from TGEV and SARS iden¬ 
tified a unique motif, KXHXX, at their C termini (Fig. 6A). S 
proteins from other group 1 coronavirus (human coronavirus 
229E, canine coronavirus, and feline infectious peritonitis vi- 
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FIG. 3. G-S is retained mostly in the ERGIC, and mutation of the dilysine motif results in transport to the plasma membrane and endocytosis. 
(A) HeLa cells expressing G-S were fixed, permeabilized, stained with either rabbit or mouse anti-VSV-G, and double labeled with either rabbit 
anticalnexin, mouse anti-ERGIC-53, or mouse anti-GM130. For cells double labeled with calnexin, the secondary antibodies were fluorescein- 
conjugated donkey anti-rabbit IgG and Texas Red-conjugated goat anti-mouse IgG. For cells double labeled with ERGIC-53 or GM130, secondary 
antibodies were fluorescein-conjugated goat anti-mouse IgG and Texas Red-conjugated donkey rabbit IgG. (B) Intact HeLa cells expressing G-Sia 
and G-S 4A were incubated with mouse anti-VSV-G at 37°C for 15 min, fixed, permeabilized, and then stained with rabbit anti-VSV-G for internal 
expression. Secondary antibodies were fluorescein-conjugated donkey anti-rabbit IgG and Texas Red-conjugated goat anti-mouse IgG. Bars, 10 
pan. 


rus) also contain this sequence at their C termini. The 
KXHXX motif fits the criteria for a dibasic motif (51) if the 
histidine residue is protonated. We postulated that this poten¬ 
tial dibasic signal could act similarly to the dilysine signal in 
IBV S. To test this hypothesis, we replaced the final 11 amino 
acids of G-S with the corresponding sequence of the SARS or 
TGEV S proteins, generating the G-S sars and G-S xgev con¬ 
structs, respectively. 

Transiently transfected HeLa cells expressing G-S sars and 
G-S tgev were examined by indirect immunofluorescence mi¬ 


croscopy (Fig. 6B). The majority of cells lacked surface expres¬ 
sion, with intracellular staining in a post-ER compartment. A 
low percentage of cells expressing G-S sars and G-S tgev (~11 
and ~6%, respectively) had detectable surface expression 
(data not shown). These were cells with overall high levels of 
expression, similar to our findings with G-S. The intracellular 
distribution of G-S sars and G-S tgev most closely overlapped 
with that of ERGIC-53 (Fig. 6C). These data suggest that the 
KXHXX motif is an intracellular localization signal similar to 
the dilysine signal in IBV S. 
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FIG. 4. G-S is rapidly trafficked through the Golgi after mutation 
of the dilysine signal. (A) HeLa cells expressing VSV-G, G-S, G-S^ 
or G-S 4A were pulse labeled with [ 35 S] methionine-cysteine for 15 min, 
chased for the times indicated, lysed, and immunoprecipitated with 
anti-VSV-G antibody. The immunoprecipitates were treated with endo 
H and subjected to SDS-PAGE. The endo H-resistant and -sensitive 
forms are indicated. (B) Quantitation of oligosaccharide processing 
rates. These data represent the averages from four independent ex¬ 
periments, with the error bars representing the standard deviations. 


Mutagenesis of the lysine and histidine residues in the 
KXHXX signal results in loss of ERGIC localization. To test 
the hypothesis that the KXHXX motif functions as a genuine 
localization signal, the histidine and lysine residues in G-S sars 
and G-S tgev were mutagenized to alanines to construct the 
G-S SARS2A and G-S tgev2a chimeras (Fig. 7A). HeLa cells 
transiently transfected with G-S 2A , G-S SARS2A , or G-S tgev2a 
were examined by indirect immunofluorescence microscopy 
(Fig. 7B). G-S SARS2A and G-S TGEV2A were efficiently trans¬ 
ported to the surface, with internal puncta similar to those for 

g-s 2A . 

Oligosaccharide processing of G-S sars , G-S tgev , 
G-S SARS2A , and G-S tgev2a (Fig. 8) confirmed that mutation 
of the lysine and histidine residues in the cytoplasmic tails of 
G-S sars and G-S tgev abrogated intracellular retention. Both 
G-S SARS2A and G-S TGEV2A obtained endo H resistance at 
rates similar to those for VSV-G and G-S 2A , whereas only low 
rates of processing occurred with chimeric proteins containing 
the KXHXX motif. 

These results point to a novel intracellular localization signal 
(KXHXX) at the C terminus of type I membrane proteins. 


A 


B 


C 




FIG. 5. The C-terminal 11 amino acids of S proteins from group 2 
coronaviruses lack intracellular localization signals. (A) The final 11 
amino acids of G-S were swapped with those from the BCV or MHV- 
A59 S proteins (underlined). Note the lack of any apparent dibasic 
motif at the —3 and —4 or —5 position from the C terminus in the BCV 
and MHV sequences. (B) Intact HeLa cells expressing G-S 2 a> G-S bcv , 
or G-S mhv were stained with mouse anti-VSV-G at 4°C, fixed, per- 
meabilized, and then stained with rabbit anti-VSV-G for internal ex¬ 
pression. Secondary antibodies were fluorescein-conjugated donkey 
anti-rabbit IgG and Texas Red-conjugated goat anti-mouse IgG. Bar, 
10 p,m. (C) HeLa cells expressing G-Sja, G-S bcv , or G-S MHV were 
pulse labeled with [ 35 S]methionine-cysteine for 15 min, chased for 0, 
15, 30, or 60 min, lysed, and immunoprecipitated with anti-VSV-G 
antibody. The immunoprecipitates were treated with endo H and sub¬ 
jected to SDS-PAGE. Endo H resistance was quantitated for three 
independent experiments (error bars represent standard deviations). 


Further experiments are required to determine if proteins that 
posses this motif use the same ER retrieval pathway followed 
by proteins with the KKXX signal. Overall, our findings dem¬ 
onstrate that S proteins from groups 1 and 3 and the SARS 
coronavirus contain signals near their C termini that specify 
intracellular localization near the virus assembly site. 

DISCUSSION 

The purpose of this study was to explore the targeting and 
accumulation of coronavirus S proteins at the site of virus 
assembly. We report that a dilysine signal (KKXX- COOH ) con- 
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4 4 
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FIG. 6. Group 1 coronaviruses and SARS S proteins contain intra¬ 
cellular localization signals in their cytoplasmic tails. (A) The final 11 
amino acids of TGEV S or SARS S (underlined), which contain pu¬ 
tative dibasic signals, were swapped for the same residues of G-S. 
(B) Intact HeLa cells expressing G-S sars or G-S tgev were stained 
with mouse anti-VSV-G at 4°C, fixed, permeabilized, and then stained 
with rabbit anti-VSV-G for internal expression. Secondary antibodies 
were fluorescein-conjugated donkey anti-rabbit IgG and Texas Red- 
conjugated goat anti-mouse IgG. (C) EleLa cells expressing G-S, G- 
Ssarsi or G-S tgev were fixed, permeabilized, and stained with rabbit 
anti-VSV-G and mouse anti-ERGIC-53. Secondary antibodies were 
fluorescein-conjugated goat anti-rabbit IgG and Texas Red-conjugated 
donkey anti-rabbit IgG. The insets show enlargements of the boxed 
regions, with the anti-VSV-G panel on top and the anti-ERGIC-53 
panel below. Bars, 10 p.m. 


tributes to the localization of IBV S near the virus assembly 
site. The cytoplasmic tail of IBV S retained a plasma mem¬ 
brane reporter protein (VSV-G) in the ERGIC and reduced 
the rate of oligosaccharide processing in the Golgi complex. 
We also studied the cytoplasmic tails of S proteins of other 





FIG. 7. Mutagenesis of the histidine and lysine residues in G-S sars 
and G-S tgev results in transport to the plasma membrane. (A) The 
histidine at —3 and the lysine at —5 were replaced by alanines (G-S SARS2 a 
and G-S TGEV2A ). (B) Intact HeLa cells expressing G-S^, G-S SARS2A , or 
G~Stgev 2 a were stained with mouse anti-VSV-G at 4°C, fixed, perme¬ 
abilized, and then stained with rabbit anti-VSV-G for internal expression. 
Secondary antibodies were fluorescein-conjugated donkey anti-rabbit IgG 
and Texas Red-conjugated goat anti-mouse IgG. Bar, 10 p,m. 


coronaviruses. The C-terminal 11 amino acids of the S protein 
from the group 1 coronavirus TGEV as well as the SARS 
coronavirus contained a signal similar to the dilysine motif 
(KXHXX- cooh ). Like the KKXX motif, the KXHXX se¬ 
quence specified intracellular localization of the G-S chimera 
in the ERGIC. However, S proteins from group 2 coronavi¬ 
ruses (including BCV and MHV) lacked such signals in this 
region of their cytoplasmic tails. Here we discuss how these 
signals may play a role in mediating the localization of coro¬ 
navirus S proteins to the site of virus assembly. 

A canonical dilysine signal is present at the carboxy termi¬ 
nus of IBV S. We found that when transiently expressed, the 
IBV S protein was localized to the ER and ERGIC. Mutation 
of the C-terminal dilysine signal resulted in readily detectable 
surface expression. The dilysine signal could be responsible for 
localizing IBV S near the site of virus assembly and could serve 
to limit surface expression. Both IBV M and E are transported 
past the virus assembly site when expressed from cDNA (7, 
32). Presumably, interactions between IBV S, M, and E result 
in localization of the three proteins at the ERGIC during virus 
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FIG. 8. Mutation of the histidine and lysine residues in G-S sars 
and G-S tgev results in rapid trafficking through the Golgi complex. 
HeLa cells expressing G-S sars G-S tgev , G-S SARS2 a> or G-S XGEV2 a 
were pulse labeled with [ 35 S]methionine-cysteine for 15 min, chased 
for 0, 15, 30, or 60 min, lysed, and immunoprecipitated with anti- 
VSV-G antibody. The immunoprecipitates were treated with endo H 
and subjected to SDS-PAGE. Endo H resistance was quantitated 
(three independent experiments, with the error bars representing the 
standard deviations). 


assembly. It is interesting that all three of the IBV envelope 
proteins possess targeting signals that maintain their localiza¬ 
tion near the ERGIC, although all three signals are different. 
It is perhaps most efficient to retain the three envelope pro¬ 
teins individually near the assembly site. 

To fully explore the contribution of the dilysine signal to 
targeting, we studied it independently of the full-length IBV S 
protein. Replacing the cytoplasmic tail of a plasma membrane 
reporter protein (VSV-G) with the cytoplasmic tail of IBV S 
resulted in ERGIC localization of the G-S chimera. When the 
two lysine residues were mutagenized to alanines, surface ex¬ 
pression of G-S 2A was readily detected and trafficking through 
the Golgi occurred at a rate similar to that of VSV-G. At very 
high expression levels, G-S was transported to the plasma 
membrane. It is likely that the retrieval machinery can be 
saturated, as shown for the H/KDEL signal for soluble ER 
resident proteins (52). 

Recognition of the dilysine signal. The dilysine signal con¬ 
sists of lysines located at the —3 and —4 or —5 position from 
the carboxy terminus (24) and is recognized by the COPI 
coatomer complex (8, 20). Our preliminary experiments sug¬ 
gest that the cytoplasmic tail of IBV S also binds the COPI 
coatomer complex (data not shown). The coatomer complex is 
composed of seven different subunits and is the major building 
block of COPI-coated vesicles. COPI vesicles mediate the ret¬ 
rograde transport of proteins from the Golgi to the ER (re¬ 
viewed in references 13 and 43) and perhaps other vesicle 
transport steps. While the COPI complex directly interacts 
with the dilysine signal, the efficiency of binding is influenced 
by residues surrounding the two lysines (51). For example, two 


phenylalanine residues immediately following a dilysine signal 
contribute to steady-state localization of ERGIC-53 to the 
ERGIC instead of the ER (1). We have not yet begun to 
explore the residues surrounding the dilysine signal in IBV S 
that influence its steady state localization. 

The role of other dibasic motifs in intracellular localization. 
Other types of signals have been found to mediate ER local¬ 
ization of cellular proteins. The Iip33 form of the invariant 
chain (a type II membrane protein) has two arginine residues 
near its N-terminal cytoplasmic tail that contribute to its ER 
localization (44). More recently, internal ER localization sig¬ 
nals (e.g., RXR) have been identified in certain oligomeric ion 
channels (59). For these dibasic ER localization signals, se¬ 
quence context contributes to efficiency of localization (58), as 
does their distance from the membrane (46). In addition, as 
shown for the dilysine motif (30), oligomerization of diargin¬ 
ine-containing subunits with other proteins can mask the rec¬ 
ognition of the localization signal, allowing transport to the 
plasma membrane in a regulated manner (38, 57, 59). Intra¬ 
cellular localization signals on viral proteins would effectively 
be masked by the assembly process, allowing exocytosis of 
budded virions. 

A novel intracellular localization motif. In the work de¬ 
scribed here, we found that a previously unknown motif 
(KXHXX) in group 1 coronaviruses and the SARS coronavirus 
can mimic the dilysine signal in IBV S, resulting in ERGIC 
localization. If the histidine residue is protonated, the 
KXHXX motif fits the criteria for a dibasic ER retrieval signal 
(51). Consistent with this, it was shown that a histidine residue 
at the —3 position of the oligosaccharide transferase subunit 
OST48 could substitute for the lysine residue normally present 
in this position (19). Sequence context is clearly important, 
since histidine could not replace the lysine at the —3 position 
in the adenovirus E3-19K protein (25). The lysine-histidine 
motif located at the carboxy terminus of TGEV is also present 
in other group 1 coronaviruses, including canine coronavirus, 
feline infectious peritonitis virus, and human coronavirus 
229E. When attached to a plasma membrane reporter protein, 
the C-terminal 11 amino acids of TGEV or SARS coronavirus 
S proteins promoted ERGIC localization. Efficient transport 
to the plasma membrane occurred when the lysine and histi¬ 
dine residues were mutagenized to alanines in the G-S sars or 
G-S tgi v chimera. 

Although dibasic signals are present in S proteins from 
groups 1 and 3 and the SARS coronavirus, there was no such 
motif in the final 11 amino acids of group 2 coronaviruses BCV 
and MHV. Interestingly, group 2 coronaviruses encode a hem¬ 
agglutinin esterase (HE) protein, although not all express it. 
HE is a type I membrane protein with arginine at the —5 
position and histidine at the —3 position from the C terminus 
(26). This RXHXX motif can function to sequester OST48 in 
the ER (19). BCV HE has been shown to oligomerize indi¬ 
rectly with S through its interaction with M (36). Thus, group 
2 coronaviruses that express HE might use a similar ER re¬ 
trieval pathway for localization of the S protein. 

Implications of ER retrieval signals for coronavirus S pro¬ 
tein function. One of the hallmarks of coronavirus infection of 
cultured cells is formation of syncytia, caused by cell-cell fusion 
induced by S protein expressed at the plasma membrane. How 
do our results impact this known activity of the coronavirus S 
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protein? We hypothesize that ER retrieval signals in conjunc¬ 
tion with slow folding of the lumenal domain in the ER effec¬ 
tively limit the level of surface S protein (39). At high expres¬ 
sion levels (as might occur late in coronavirus infection or with 
use of expression vectors, such as vaccinia virus), the S protein 
could saturate the ER retrieval machinery and be transported 
to the plasma membrane. Retention of S near the budding site 
would thus be regulated by the level of S expressed. In support 
of the idea that coronaviruses regulate the level of surface S 
protein, we found that IBV S contains an endocytosis signal 
upstream of the dilysine signal in its cytoplasmic tail. This 
internalization signal ensures that IBV S protein that reaches 
the plasma membrane does not remain there for long. Since S 
proteins from group 2 coronaviruses lack ER retrieval signals 
in their cytoplasmic domains, this group may depend on cell- 
to-cell spread of the infection by syncytium formation to a 
greater extent than other coronaviruses. Further experiments 
are needed to explore the importance of dibasic localization 
signals in assembly and pathogenesis of different coronavi¬ 
ruses. 
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